Local model deployment, model quantization, inference optimization, edge deployment

Feeds to Scour
SubscribedAll
Scoured 4937 posts in 370.2 ms
MIT’s new ‘recursive’ framework lets LLMs process 10 million tokens without context rot
venturebeat.com·1d·
💸Affordable LLMs
Preview
Report Post
Planning Poker Integration for Linear 🃏
dev.to·5h·
Discuss: DEV
💸Affordable LLMs
Preview
Report Post
Local models to support home network infrastructure?
news.ycombinator.com·1d·
Discuss: Hacker News
🏠Self-hosting
Preview
Report Post
Vision RAG: Enabling Search on Any Documents
mongodb.com·1d
🔍RAG
Preview
Report Post
From 75% to 99.6%: The Math of LLM Ensembles
shibaprasadb.com·1d·
Discuss: Hacker News
💸Affordable LLMs
Preview
Report Post
The three types of LLM workloads and how to serve them
modal.com·17h·
Discuss: Hacker News
💬Prompt Engineering
Preview
Report Post
featurestorebook/mlfs-book: O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs
github.com·8h·
Discuss: Hacker News
🦙Ollama
Preview
Report Post
Scaling Interaction, Not Parameters: A Hands-On Guide to MiroThinker 1.5
dev.to·1d·
Discuss: DEV
📱Edge AI
Preview
Report Post
Without benchmarking LLMs, you're likely overpaying 5-10x
karllorey.com·1d·
Discuss: Hacker News
💸Affordable LLMs
Preview
Report Post
Searching the Physical World: Bridging 3D Models and LMMs
spatialview.io·2d·
Discuss: Hacker News
🧲Vector Search & Embeddings
Preview
Report Post
Conversation: LLMs and the what/how loop
martinfowler.com·18h
🔧Code Refactoring Patterns
Preview
Report Post
MLSN #18: Adversarial Diffusion, Activation Oracles, Weird Generalization
lesswrong.com·1d
🛡️AI Security
Preview
Report Post
Evolution of LLMs use by a programmer
asfaload.com·16h·
Discuss: Hacker News
💬Prompt Engineering
Preview
Report Post
A drop-in infrastructure layer for resilient, AI UX
ably.com·1d·
Discuss: DEV
AI-Driven DevOps
Preview
Report Post
Momory: AI Real-Time Stream Subtitles and Translation
momory.dev·4h·
Discuss: Hacker News
📹WebRTC
Preview
Report Post
LLMs Under Siege: The Red Team Reality Check of 2026
eddieoz.com·13h·
Discuss: Hacker News
🛡️AI Security
Preview
Report Post
LLM API Providers Leaderboard - Comparison of over 500 AI Model endpoints
artificialanalysis.ai·4d
💸Affordable LLMs
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
📱Edge AI
Preview
Report Post
Can We Build an NX Bit for LLMs
bogdandeac.com·1d·
Discuss: Hacker News
💬Prompt Engineering
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help